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Abstract 



We revisit affine diffusion processes on generai and on the canonicai state space in partic- 
ular. A detailed study of theoretic and applied aspects of this class of Markov processes is 
given. In particular, we derive admissibility conditions and provide a full proof of existence 
and uniqueness through stochastic invariance of the canonical state space. Existence of expo- 
nential moments and the full range of validity of the affine transform formula are established. 
This is applied to the pricing of bond and stock options, which is illustrated for the Vasicek, 
Cox-Ingersoll-Ross and Heston models. 

1 Introduction 

Affine Markov models have been employed in finance since decades, and they have found 
growing interest due to their computational tractability as well as their capability to capture 
empirical evidence from financial time series. Their main applications lie in the theory of term 
structure of interest rates, stochastic volatility option pricing and the modeling of credit risk 
(see [T2] and the references therein). There is a vast literature on affine models. We mention 
here explicitly just the few articles [2J gl |51 [ISl UHl HHISJ HH1 [551 [ST] [25] and [12] for a broader 



In this paper, we revisit the class of affine diffusion processes on subsets of R and on the 
canonical state space x R", in particular. In Section [2] we first provide necessary and 
sufficient conditions on the parameters of a diffusion process X to satisfy the affine transform 
formula 



The functions <f> and tp in turn are given as solutions of a system of coupled Riccati equations. 
Arguing by stochastic invariance, in Section [3] we can further restrict the choice of admissible 
diffusion parameters. 

Glasserman and Kim [16] showed recently that the affine transform formula holds whenever 
either side is well defined under the assumption of strict mean reversion. This is an extension 
of the findings in [12| . where only sufficient conditions are given in terms of analyticity of 
the right hand side. The strict mean reversion assumption, however, excludes the Heston 
stochastic volatility model. In our paper, we show that strict mean reversion is not needed 
(Theorem 13. 3[) . As a by product, we obtain some non-trivial convexity results for Riccati 
equations. Having the full range of validity of the above transform formula under control, in 
Section 2] we can then proceed to pricing bond and stock options in affine models. Particular 
examples are the Vasicek and Cox-Ingersoll-Ross (CIR) short rate models in Section [5] and 
Heston's stochastic volatility model in Section [6] 

The representation of affine short rate models bears some ambiguity with respect to linear 
transformations of the state process. This motivates the question whether there exists a clas- 
sification method ensuring that affine short rate models with the same observable implications 
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have a unique canonical representation. This topic has been addressed in 10 , 9 , 24 8 . In Scc- 
tion[71 we recap this issue and show that the diffusion matrix of X can always be brought into 
block-diagonal form by a regular linear transform leaving the canonical state space invariant. 

The existence and uniqueness question of the relevant stochastic differential equation is 
completely solved through stochastic invariance and the block-diagonal transformation in 
Section [8] The presented proof builds on the seminal result by Yamada and Watanabe [35] . 
We therefore approach the existence issue differently from [T2] which uses infinite divisibility 
on the canonical state space and the Markov semigroup theory. 

In the appendix, we provide some self contained proofs of existence and comparison state- 
ments for relevant systems of Riccati equations (Section [B}. Moreover, some moment lemmas 
from [12] in a more elaborated fashion can be found in Section 1X1 



2 Definition and Characterization of Affine Processes 

Fix a dimension d > 1 and a closed state space X C R d with non-empty interior. We let 
b : X - 
matrix 



b : X — ► R d be continuous, and p : X — * R axa be measurable and such that the diffusion 



a(x) = p(x)p(x) 

is continuous in x € X. Let W denote a d-dimensional Brownian motion defined on a filtered 
probability space (fi, J-, (Ft),P). Throughout, we assume that for every x £ X there exists a 
unique solution X — X x of the stochastic differential equation 

dX(t)=b(X(t))dt + p(X(t))dW(t), X(0) = x. (2.1) 

Definition 2.1. We call X affine if the Ft-conditional characteristic function of X(T) is 
exponential affine in X(t), for all t < T. That is, there exist C- and C d - valued functions 
4>{t,u) and ip(t,u), respectively, with jointly continuous i-derivatives such that X — X x 

S c\ l" lSllGS 

E [e 1,Tx(T) | .Ft] = e'* (T-t ' ,l)+ ' / ' (T ~ t ' u)Tx(i) (2.2) 

for all u £ m d , t<T and x £ X. 

Since the conditional characteristic function is bounded by one, the real part of the expo- 
nent 4>{T — t,u)+ ip(T — t, u) T X(t) in (|2.2|l has to be negative. Note that <j>(t,u) and ip(t,u) 
for t > and it £ iR d are uniqueljQ determined by (|2.2|l . and satisfy the initial conditions 
0(0, u) — and tp(0,u) = it, in particular. 

We first derive necessary and sufficient conditions for X to be affine. 

Theorem 2.2. Suppose X is affine. Then the diffusion matrix a(x) and drift b(x) are affine 
in x. That is, 

d 

a(x) = a + 22 x i a i 



i = i 

d 



(2.3) 



b(x) = b + J2 Xif3i = b + Bx 
i=i 

for some d x d-matrices a and an, and d-vectors b and f5i, where we denote by 

B = 09i 1 ... 1 /9 d ) 

the d x d-matrix with i-th column vector (3i, 1 < i < d. Moreover, (f> and ip = (ipi, . . . ,ipd) T 
solve the system of Riccati equations 

dt4>{t, it) = o 1 ^^' U ) T a Tp(f> u ) + b T, ip(t, it) 



<K0, u) = 

i>i{t,u) = 

tjj(0, u) = u. 



d t ipi{t, u) = -V'C*! u) T cti ip(t, u) + f3jip(t, u), l<i<d, 



(2.4) 



1 In fact, 4>(t, u) may be altered by multiples of 2iri. We uniquely fix the continuous function <j>{t, u) by the initial 
condition 0(0, u) = 0. 
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In particular, <j> is determined by if) via simple integration: 

4>(t,u) = J ^^ip(s,u) T aip(s,u) + b T \[)(s,u)J ds. 

Conversely, suppose the diffusion matrix a(x) and drift b(x) are affine of the form (|2.3[) and 
suppose there exists a solution (<^>, of the Riccati equations (|2.4[) such that (f)(t,u)+ip(t,u) T x 
has negative real part for all t > 0, u G iR d and x G X . Then X is affine with conditional 
characteristic function (|2.2[) . 

Proof. Suppose X is affine. For T > and u G iR d define the complex- valued Ito process 

]tfU\ _ e 4>{T-t,u)+4>(T-t,u) T X(t) 

We can apply Ito's formula, separately to real and imaginary part of M, and obtain 
dM(t) = I(t) dt + iP{T - t, u) T p(X(t)) dW{t), t < T, 

with 

I(t) = -d T <t>(T -t,u)- d T ip{T - t, u) T X(t) 

+ ip(T - t,u) T b(X(t)) + i-i/)(T - t, u) T a(X(t)) ip(T -t,u). 

Since M is a martingale, we have I(t) = for all t < T a.s. Letting t — > 0, by continuity of 
the parameters, we thus obtain 

d T 4>(T, u) + d T 4>(T, u) T x = ip(T, u) T b(x) + u) T a(x) 4>(T, u) 

for all x £ X, T > 0, u G iR d . Since tj}(0,u) = u, this implies that a and 6 are affine of the 
form (|2.3p . Plugging this back into the above equation and separating first order terms in x 
yields QTfy . 

Conversely, suppose a and b are of the form (|2.3p . Let (</>,tp) be a solution of the Riccati 
equations (|2.4[) such that (f>(t,u) + ip(t,u) T x has negative real part for all t > 0, u G iM d and 
x £ X. Then M, defined as above, is a uniformly bounded local martingale, and hence a 
martingale, with M(T) = e uT X{T) . Therefore E[M(T) | Tt] = M(t), for all t < T, which is 
(|2.2|) . and the theorem is proved. □ 

We now recall an important global existence, uniqueness and regularity result for the above 
Riccati equations. We let K be a placeholder for either R or C. 

Lemma 2.3. Let a and at be real d x d-matrices, and b and f3i be real d-vectors, 1 < i < d. 

(i) For every u G K , there exists some t+(u) € (0, oo] such that there exists a unique solu- 
tion (4>(-,u),ip(-,u)) : [0,t+(u)) — > K x K d of the Riccati equations (|2.4[) . In particular, 
i+(0) = oo. 

(ii) TTie domain 

V K = G R+ x A' d | f < t+(u)} 

is open in R+ x K and maximal in the sense that for all u G K d either t+ (u) — oo or 
lim tTt+{ll) ||^(t,u)|| = oo, respectively, . 

(iii) For every t G R+, the t-section 

V K {t) = {u£K d | (t,u)eV K } 

is an open neighborhood of in K d . Moreover, T>k(0) = K d and T>K(ti) 5 Daif-i) for 
< ti < t 2 . 

(iv) </> and ip are analytic functions on T>k ■ 

(v) v R = v c n (R+ x R d ). 

Henceforth, we shall call T>k the maximal domain for equation (12.41) . 
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Proof. Since the right-hand side of (|2.4p is formed by analytic functions in ip on K d , part 
follows from the basic theorems for ordinary differential equations, e.g. [I] Theorem 7.4]. In 
particular, t+(0) = oo since (</>(•, 0), ip(-, 0)) = is the unique solution of (|2.4|l for u — 0. It 
is proved in Q] Theorems 7.6 and 8.3] that T>k is maximal and open, which is part (ii) This 
also implies that all t-sections Vnit) are open in K d 
consequence of the maximality property from part 



of part (iv) see [111 Theorem 10.8.2]. Part (v) is obvious. 



The inclusion 2?x(ti) 3 2?x(fa) is a 
For a proof 
□ 



Whence part (iii) follows 



We will provide in Section[B]below some substantial improvements of the properties stated 
in Lemma 12.31 for the canonical state space X introduced in the following section. 



3 Canonical State Space 



There is an implicit trade off between the parameters a, on, b, j3i in (|2.3p and the state space 
X: 

• a, on, b, j3i must be such that X does not leave the set X, and 

• a, an must be such that a + X^iLi x i a i is symmetric and positive semi-definite for all 
x G X. 

To gain further explicit insight into this interplay, we now and henceforth assume that the 
state space is of the following canonical form 



for some integers m, n > with m + n — d. 

Remark 3.1. This canonical state space covers most applications appearing in the finance 
literature. However, other choices for the state space of an affine process are possible: 

(i) For instance, the following example for d = 1 admits as state space any closed interval 
X C R containing 0: 

dX = -Xdt, X(0) = x€X. 

This degenerate diffusion process is affine, since e " x(T) = e uo <T t)x ( J ) for all t < T 
(" [12] . Section 12). In general, affine diffusion processes on compact state spaces have to 
be degenerate. 

(ii) Matrix state-spaces Sj~ (d > 2), the cone of symmetric positive definite matrices (see 

[51 ED ED EI Ell ■ 

(iii) Parabolic state-spaces, cf. [18], which are in turn, related to quadratic processes on the 
canonical state-space ([7|, see also their Example 5.3, section 5) 

For the above canonical state space, we can give necessary and sufficient admissibility 
conditions on the parameters. The following terminology will be useful in the sequel. We 
define the index sets 

J = {l,...,m} and J = {m + 1, . . . , m + n}. 
For any vector /j, and matrix v, and index sets M, N, we denote by 

y-M = (^iJiEM, VMN = (Vij )ig M, jgJV 

the respective sub-vector and -matrix. 

Theorem 3.2. The process X on the canonical state space R™ x K" is affine if and only if 
a(x) and b(x) are affine of the form (|2.3|l for parameters a, on, b, f3i which are admissible in 
the following sense: 

a, cti are symmetric positive semi-definite, 
an = (and thus au = ajj = 0), 
ctj — for all j € J 
ce t ,ki = OLi.i k = for k€ I \ {i}, for all 1 < i,l < d, (3.1) 
beR+ x R n , 
Bu = 0, 

Bu has positive off-diagonal elements. 
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(3.2) 



In this case, the corresponding system of Riccati equations ()2.4[1 simplifies to 

dt<t>{t,u) = ^j(t,u) T ajj^j(t,u) +b T ^(t,u) 
4>(0,u) = 

d t ipi{t,u) = -V>(t,u) T OLiij}{t,u) + pjip(t,u), i £ I, 

d t ipj{t,u) = Bjjipj(t,u), 
ip(0, u) = u, 

and there exists a unique global solution (<j>(-,u),ij)(-,u)) : R+ — ► C_ X C™ X iR™ for all initial 
values u £ C™ x iR n . In particular, the equation for ipj forms an autonomous linear system 
with unique global solution ipj(t,u) — e Bjjt uj for all uj £ C n . 

Before we prove the theorem, let us illustrate the admissibility conditions (|3.ip for the 
diffusion matrix a(x) for dimension d = 3 and the corresponding cases m = 0, 1, 2, 3. Note 
that a(x) = a + X^^Li x i a i, hence in the case m = we have 

a(x) = a 

for an arbitrary positive semi-definite symmetric 3 x 3- matrix a. For m = 1, we have 

/ + * * \ 
ai = + * , 



for m 






ai = U U , Q2 = 




and for m — 3, 



/ 

o = 0, ai = I I, a 2 =| + I, os= 





where we leave the lower triangle of symmetric matrices blank, + denotes a non-negative real 
number and * any real number such that positive semi-definiteness holds. 

Proof of Theorem \3.S\ Suppose X is affine. That a(x) and b(x) are of the form (|2.3|) follows 
from Theorem 12.21 Obviously, a(x) is symmetric positive semi-definite for all x £ R+ x R n 
if and only if ctj = for all j £ J, and a and ccj are symmetric positive semi-definite for all 
i £ I. 

We extend the diffusion matrix and drift continuously to R d by setting 
a(x) = a + xfai and b(x) = b + xf/3i + Xj(3j . 

Now let a; be a boundary point of R™ x K™. That is, Xk = for some k £ I. The stochastic 
invariance Lemma IB . f I below implies that the diffusion must be "parallel to the boundary", 



e. 



and the drift must be "inward pointing" 



5 



Since this has to hold for all Xi > 0, i G / \ {k}, and Xj G R, j £ J, we obtain the following 
set of admissibility conditions 

a, a.i are symmetric positive semi-definite, 

a ek = for all k G I, 

oti ek = for all i 6 J \ {A:}, for all fc G /, 

Qj = for all j G J, 

& G R+ x R", 

A T efc > for all i G I \ {k}, for all k G J, 

/j/efc = for all j G J, for all fc G /, 

which is equivalent to (|3.1|l . The form of the system (|3.2p follows by inspection. 

Now suppose a, ai, 6, /3i satisfy the admissibility conditions (|3.ip . We show below that 
there exists a unique global solution (<f>(-,u),ip(-,u)) : R+ — > C_ x CH 1 x iR n of (|3.2|) . for all 
u G C" 1 x iR". In particular, 4>(t,u) + ip(t,u) T x has negative real part for all t > 0, u G iR d 
and a; G R+ x R n . Thus the first part of the theorem follows from Theorem |2~21 

As for the global existence and uniqueness statement, in view of Lemma [2, 3 1 it remains to 
show that ip(t,u) is C m x iR"-valued and t+(u) = oo for all u G CI 1 x iR n . For i G /, denote 
the right-hand side of the equation for ipi by 

Ri{ u ) = -u T ctiU + flju, 

and observe that 

Let us denote xf — (xf, . . . , ) T . Since $lipj(t,u) — 0, it follows from the admissibility 
conditions (|3.I[1 and Corollary IB. 21 below, setting f(t) = —^Stip{t,u), 

b t {t,x) = -^a Mi (a;^) 2 + ^^(t,u) T a t Qip(t,u) + pjjxj , i G /, 

and bj(t,x) = for j G J, that the solution ip(t,u) of (|3.2|) has to take values in C™ x iR n 
for all initial points u G C" 1 x iR" . 

Further, for i G / and m G C d , one verifies that 

#l(uiRi(u)) = -a;,ii|iti| 2 5ftui + 5R(Tt7 ?iiai,ij uj) + -^(Tid uj cti^jj uj) + #t(uiflju) 

<§(i + ll(^/) + ll + IMI 2 ) (i + IKII 2 ) 

for some finite constant K which does not depend on u. We thus obtain 
<9 t ||V/(^)|| 2 = 2K (^M T Ri (V/(t,w),e s ^ f ttj)) 

for 

= (f + + He 8 -' uj\\ 2 ) . 

Gronwall's inequality ( |III (10. 5. 1. 3)]), applied to (I + \\ipi (t, u) || 2 ), yields 

Ui{t,u)\\ 2 < ||iz/|| 2 + A' (1 + llii/H 2 ) f g(s)e K ^ 3 ^ dS; ds. (3.3) 

Jo 

From above, for all initial points u G C™ x iR", we know that (?H'tpi(t,u)) + = and 

therefore t+(u) = oo by ()3.3|l . Hence the theorem is proved. □ 

Now suppose X is afiine with characteristics (|2.3p satisfying the admissibility conditions 
(|3.I p . In what follows we show that not only can the functions <j)(t, u) and ip{t, it) be extended 
beyond u G iR d , but also the validity of the afiine transform formula (|2.2[) carries over. This 



G 



asserts exponential moments of X(t) in particular and will prove most useful for deriving 
pricing formulas in affine factor models. 

For any set U C R fc (k G N), we define the strip 



5(17) = Iz G C k | G U ] 



in C fe . The proof of the following theorem builds on results that are derived in Sections lAl 
and [Bl below. 



Theorem 3.3. Suppose X is affine with admissible parameters as given in (|3.1[1 . Let r > 0. 

Then 

(i) S(V r (t)) C V c (r) 
(ii) T>r(t) — M(t) where 



M{t) = ju G R d | E [e" Tx " (T) ] < oo for all x G 



(iii) Dr(t) and Ok are convex sets. 
Moreover, for all < t < T and x G R+ x R" , 

(iv) (|22J| holds for all u G <S (£>r(T - t)) 

(v) ([22]) ZioZds /or aZZ ueC ra x iR n 

(vi) M(t) D M{T). 

Proof. We first claim that, for every u G C d with t+(u) < oo, there exists some i £ I and 
some sequence t„ | such that 

lim(K^(t n ,^)) + = C5G. (3.4) 

Indeed, otherwise we would have sup tg [ t \\ (3?^j(t, w)) + 1 < oo. But then (|3.3[) would 
imply sup tg [ t+ („)) ||^7(t,w)|| < oo, which is absurd. Whence (|3.4[) is proved. 
In the following, we write 

G(u,t,x) = E [c" Txa:(t, ] , V(t,x) = |u G R d | G(u,t,x) < ooj . 
Since X is afhne, by definition we have R+ x iR d C X>c and H2.2|) implies 

G(«,t,aO = e ^( t '"'+*( t -") T:!: (3.5) 

for all u G iR d , t G M+ and x G R+ x R n . Moreover, by Lemma IB~5l. T>n{t) = 23 c (<) n R d is 
open and star-shaped around in R d . Hence Lemma lA.31 implies that T>m(t) C V(t,x) and 
holds for all it G £> c (f) n <S(2? R (t)), for all x G R+ x 1™ and t G [0, r]. 
Now let u G Cr(t) and v G R d , and define 

6* = inf{6» G R+ | u + i6v £ V c (t)}. 

We claim that 9* — oo. Arguing by contradiction, assume that 6* < oo. Since T>c{t) is open, 
this implies u + \9*v £ T>c{t), and thus 

t+(u + i6*v) < t. (3.6) 

On the other hand, since 2?r(t) is open, (1 + e)it G 2?r(t) for some e > 0. Hence (|3.5|) holds 
and G(t, (1 + e)u,x) is uniformly bounded in t G [0, t], by continuity of <f>(t, (1 + e)u) and 
4>(t, (1 + e)u) in t. We infer that the class of random variables {e ("+ ie *«) T *U) | j G [o, r ]} is 
uniformly integrable, see [341 13.3]. Since X(t) is continuous in t, we conclude by Lebesgue's 
convergence theorem that G(t,u + i8*v,x) is continuous in i 6 [0, r], for all x G R+ x R™. 
But for all t < t+(u + i0*v) we have (t,u + \6*v) G T> c (t) n and thus ([33)) holds 

for all x G R+ x R n . In view of (13.411 . this contradicts (13.61) . Whence 0* = oo and thus 
u + iv G Vc(t). This proves [(ij 



2 For an alternative proof of the above, see remark lB.71 
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Applying the above arguments td3 E [e" Tx|T) | J" f ] = G(T - t,u,X(t)) with T = t + r 
"follows, since, by Theorem 1331 C" 1 x iR™ C S(V u (t)) for all t G R+. 

it follows that u G M(r). Conversely, 



yields (iv) 



Part ( 



As for (ii) we first let u G X>b(t). From part (iv 
let u G M(t), and define 9* = sup{# > 
Assume, by contradiction, that 9* < 1 
x* G R!L l x R n such that 



u G X>k(t)}. 
From Lemma IB. 51 , 



We have to show that 9* > 1. 
we know that there exists some 



lim 

Ate* 



(r, 0«) + V(t, 



(3.7) 



On the other hand, from part (iv) and Jensen's inequality, we obtain 

e Hr,eu)+i,(. T ,e^) T x' = G ( T]0Uj;E *) < g(t,u,x*) 6 < G(t,u,x*) < oo 



for all 9 < 9* . But this contradicts (13.7 



A/ (r) is convex, this also implies (iii) Finally, part (vi) follows from part 
Whence the theorem is proved. 



hence u G 2?m(t), and part (ii) is proved. Since 

and Lemma[2.3l 



□ 



Remark 3.4. Glasserman and Kim [16] proved the equality in Theorem 13.31 (ii) and the 
validity of the transform formula (|2.2p for all u in an open neighborhood of 2?k(T — t) in C d , 
under the additional assumption that B has strictly negative eigenvalues. That assumption, 
however, excludes the simple Heston stochastic volatility model in Section [6] below. 

Remark 3.5. In Keller-Ressel |27l Theorem 3.18 and Lemma 3.19] it is shown that 

M(r + e) CO R (r) 

for all e > 0, for a more general class of affine Markov processes X x . Obviously, in our 
framework, this is implied by parts [(ilj| and [(vi)] of Theorem 13.31 

Remark 3.6. The convexity property of the maximal domain stated in Theorem 13.3 



hi) 



represents a non-trivial result for ordinary differential equations. Only in the mid 1990s have 
corresponding convexity results been derived in the analysis literature, see Lakshmikantham 
et al. EH. 



4 Discounting and Pricing in Affine Models 

We let X be affine on the canonical state space R™ x R" with admissible parameters a, on, b, (3i 
as given in (|3.1[) . Since we are interested in pricing, and to avoid a change of measure, we 
interpret P = Q as risk-neutral measure in what follows. 
A short rate model of the form 

r(t) = c + 1 T X(t), (4.1) 

for some constant parameters c G R and 7 G R d , is called an affine short rate model. Special 
cases, for dimension d = 1, are the Vasicek and Cox-Ingersoll-Ross short rate models. We 
recall that an affine term structure model always induces an affine short rate model. 

Now consider a T-claim with payoff f(X(T)). Here / : R+ x R n — > R denotes a measurable 
payoff function, such that f(X(T)) meets the required integrability conditions 

E \e- J o r ^ ds \f(X(T))\^ < 00. 

Its arbitrage price at time t < T is then given by 

Tv{t) =E [e- / * Tr(s)ds /(A'(T)) I jF t ] . (4.2) 

A particular example is the T-bond with / = 1. Our aim is to derive an analytic, or at least 
numerically tractable, pricing formula for (|4.2|l . To this end we shall make use of a change 
of numeraire technique to price, e.g., Bond options and caplets. Denote the risk free bank 
account by B(t) := r(a)ds . For fixed T > it is easily observed that 



P(0,T)B(T) 

3 Here we use the Markov property of X, see |25l Theorem 5.4.20]. 



— . — . . > and E 
P(0,T)B(T) 
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hence we may introduce an equivalent probability measure Q T ~ Q on Fr by its Radon- 
Nikodym derivative 

dQ T 1 



dQ P(0,T)B(T)' 
} T is called the T-forward measure. Note that for t < T, 



= E[ 

Ft 



P(0,T)B(T) 



F, 



P W (4.3) 



P(Q,T)B(t) 



As a first step towards establishing useful pricing formulas, we derive a formula for the 
Ft-conditional characteristic function of X(T) under Q T , which up to normalization with 
E r e -A Tr W ds | Ft] equals, 

E j e -/frW<k e u T x(T) | .F t ] , it G iR d (4.4) 

(use equation (|4.3[) . 

Note that the following integrability condition (i) is satisfied in particular if r is uniformly 
bounded from below, that is, if 7 G R+ x {0}. 

Theorem 4.1. Let r > 0. The following statements are equivalent: 

(i) E [Wo 1 " '■(*)*'] < 00 for all x G R!? x R". 

(ii) There exists a unique solution ($(•, u), ^(-, 11)) : [0, r] — > C x C d 0/ 

dt$(t, it) = j(t, u) T ajj <f.,(t, u) + 6 T *(t, u) - c, 
*(0,«)=0, 

8t*i(*,«) = i*(t,u) T a I *(t,i t )+A T *(t,u)~ 7l , »6l, (4 ' 5) 

dti>j{t,u) =Bjj^j(t,u)-jj, 
*(0, u) = it 



/or it = 0. 

7n either case, there exists an open convex neighborhood U ofOin R d such that the system 
of Riccati equations \4-5\ admits a unique solution ($(•, u), it)) : [0, r] — > C X C d for all 
u G S(U), and (|4.4p allows the following afpne representation 

E | e -''* Tr(s) ' i!, e uTx(T) I Ft] = e * (T ~ t '" )+ * (T ~ t '" )Tjf(t) (4.6) 

for all u G 5(17), t < T < t + r and x G R+ x R". 

Proof. We first enlarge the state space and consider the real-valued process 
Y(t)=y + J* (c + 7 T A( S ))d s , y G R. 

A moment's reflection reveals that X ' = is an Rip x R n+1 -valued diffusion process 

with diffusion matrix a' + x i a' i and drift b' + B'x' where 

a \ , ( on 0\ y = ( b \ B , ( B 



/ ' \ J ' " \ c J ' " V7 T ° 

form admissible parameters. We claim that X' is an afiine process. 
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Indeed, the candidate system of Riccati equations reads 



d t (p'(t,u,v 
</>'(0, u, v 
dtip'i(t, u, v 
d t ip'j(t,u,v 
d t ip'd+i(t,u,v 

tp'(0, u, v 



^ip'j(t,u,v) T ajjip'j(t,u,v) + b T ^' {1< ^ id} (t,u,v) + [cv], 
0, 

^i>'(t,u,v) T ai^'(t,u, v) + /3jip'(t,u, v) + | jjv | , i G I, 
Bjji>'j(t,u,v) + 



7ju 



(4.7) 



Here we replaced the constant solution 4>d+i{-,u, v) = v by v in the boxes. Theorem l3.2l carries 
over and asserts a unique global C_ x C" 1 x iR™ +1 -valued solution (<t>'(-,u,v),ip'(-,u,v)) of 
(|4T5|) for all (u,v) G C m x iR™ x iR. The second part of Theorem ROl thus asserts that X' is 
affinc with conditional characteristic function 



E e 



u ' X(T)+vY(T) 



<j>' (T-t,u,v)+i/i'(T-t,u,v) T X(t)+vY(t) 



for all (u, v) eC ra x iR™ x iR and t < T. 

The theorem now follows from Theorem l3.3l once we set <I>(t, u) — <j>'(t, u, —1) and ^(t, u) = 



Suppose, for the rest of this section, that either condition (i) or (ii) of Theorem 14. H is met. 
As immediate consequence of Theorem 14. II we obtain the following explicit price formulas for 
T-bonds in terms of $ and ty. 

Corollary 4.2. For any maturity T < t, the T-bond price at t < T is given as 



P(t,T)=e 



-A(T-t)-B(T-t) ' X(t) 



where we denote 

A(t) = -*(t,0), B(t) = -*(«, 0). 

Moreover, for t < T < S < r , the Ft -conditional characteristic function of X(T) under 
the S-forward measure Q s is given by 



r u 1 



-A(S-T) + <S>(T-t,u-B(S-T))+V{T-t,u-B(S-T)) 1 X(t) 



P(t,S) 

for all u G S(U + B(S — T)), where U is the neighborhood of in R d from Theorem\4-l 



(4.8) 



Proof. The bond price formula follows from (|4.6[) with u — 0. 

Now let t < T < S < t and u G 5(17 + B{S - T)). We obtain from (gSJ) by nested 
conditional expectation 



- ff r(s)ds u T X(T) 



T t ] =E[e-^'' (s)ds E[e-^ rWds |^t]c" Tx(t) |^ t ] 

E |e _ r(s)ds e (u-B(S-T)) T X(T) | 



e A(S-T) 

^^(T-t^-BiS-T^ + ViT-t.u-BiS-T)^ X(t) 



e A(S — T) 

Normalizing by P(t,S) yields (|4.8[1 . 

For more general payoff functions /, we can proceed as follows. 



□ 
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• Either we recognize the ^-conditional distribution, say q(t,T,dx), of X(T) under the 
T-forward measure from its characteristic function (|4.8|) . Or we derive q(t,T,dx) via 
numerical inversion of the characteristic function (|4.8[) . using e.g. fast Fourier transform 
(FFT). Then compute the price (|4.2[1 by integration of / 

n(t) = P(t,T) [ f(x)q(t,T,dx). (4.9) 

Examples are given in Section below. 

• Or suppose / can be expressed by 

/(*)= f e^ T *f(y)dy (4.10) 



for some integrable function / : R d — > C and some constant u £ U. Then we may apply 
Fubini's theorem to change the order of integration, which gives 



ir(t) = E 



Jv. d 

f E L-tf^ts^u+^xm | j {y)dy (4 n) 



} <Z(T-t,u+iy)+9(T-t,u+iy) T X(t) dy 

This integral can be numerically computed. An example is given in Section [5] below. 

The function / in (|4,10[1 can be found by Fourier transformation, as the following classical 
result indicates. 

Lemma 4.3. Let f : R d — » C be a measurable function and a£l be such that the function 
h(x) = e~ u x f(x) and its Fourier transform 

h(v) — / h(x)e~ lv x dx 

are integrable on M d . Then (|4.10|l holds for almost all x £ R d for 

Moreover, the right hand side of (|4.10|) is continuous in x. Hence, if f is continuous then 
flUTQll holds for all 

Proof. From Fourier analysis, see [331 Chapter I, Corollary 1.21], we know that 

h{x) = ^rL/ vTxHy)dv 

for almost all x £ R d . Multiplying both sides with e" x yields the first claim. 

From the Riemann Lcbcsguc Theorem ( 33, Chapter I, Theorem 1.2]) we know that the 
right hand side of (|4.10|l is continuous in a;. □ 

An example is the continuous payoff function 

fix) = (e"- K )+ 

of a European call option with strike price K on the underlying stock price e L , where L may 
be any affine function of X. Fix a real constant p > 1. Then h(x) — e~ px f(x) is integrable 
on R. An easy calculation shows that its Fourier transform 



Jr 



: dx ■ 



(p + iy)(p + iy - 1) 
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is also integrable on R. In view of Lemma 14.31 we thus conclude that, for p > 1, 

'■'-^-^•"" Vw^-D *- ( " 2) 

which is of the desired form (|4.10[) . We will apply this for the Heston stochastic volatility 
model in Section [6] below. 

A related example is the following 

if. , Tfi—p—iv 

( e » _ x)+ _ e " = — / e (p+la)a: 7 % T dy, (4.13) 

V ' 2^7 M (p + iy)(p + iy-l) V ; 

which holds for all < p < 1. 

More examples of payoff functions with integral representation, including the above, can 
be found in [21] , 

5 Bond Option Pricing in Affine Models 



We can further simplify formula (|4.9[) for a European call option on a S'-bond with expiry 
date T < S and strike price A. The payoff function is 

f(x)= (e-MS-T^Bts-T^x _ K 

We can decompose (|4.2|l . 

7r c (t;T,S) = P(t,S)Q s [E I Tt] - KP(t,T)Q T [E | T t \ (5.1) 

for the event E = {B(S - T) T X{T) < -A{S - T) - log A'}. The pricing of this bond option 
boils down to the computation of the probability of the event E under the S- and T-forward 
measures. 

Similarly, the value of a put equals 

n P (t;T,S) = KP(t,T)Q T [E c \ T t \ - P(t,S)Q S [E c | T t \ (5.2) 

for the event E c = ft \ E = {B(S - T) T X(T) > -A{S — T) — log A}. 

In the following two subsections, we illustrate this approach for the Vasicek and Cox- 
Ingersoll-Ross short rate models. 

5.1 Example: Vasicek Short Rate Model 

The state space is R, and we set r = X for the Vasicek short rate model 

dr = {b + pr) dt + a dW. 



The system (|4.5[) reads 

®(t, u ) = t;° 2 [ ^ 2 {s,u)ds + b f $>(s,u)ds 
2 Jo Jo 

d t V(t,u) =0V{t,u)- 1, 
*(0, u) = u 

which admits a unique global solution with 

e 0t _ l 

*(t,u) = e pt u — 

P 



J3t_ l e f>i 
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for all u £ C. Hence (|4.6|l holds for all u £ C and t <T. In particular, by Corollary 14,21 the 
bond prices P(t,T) can be determined by A and B, 

e 0t - 1 

B(t) = -*(*, o) = 2_ r -^ ) 

A(t) = -*(t, 0) = - 4e^ + 2/3t + 3) + ft 6 " ~ * ~ ^ 

Hence, under the S-forward measure, r(T) is ^-conditionally Gaussian distributed with 
(cf. g], chapter 3.2.1) 



E QS [r(T) | Tt] = r(t)e-^ T - s) +M s (t,T), 
Var s (r(T) \ T t ) = (J 2 



2e 2/3(T-t) _ j 



2/3 

where M s is defined by 

M S (t,T) = (| - ^)(1 - e-« r -") + [e-« s " T ) - >] . 

The bond option price formula for the Vasicek short rate model can now be derived via (|5.ip 
and (15.21). 



5.2 Example: Cox— Ingersoll— Ross Short Rate Model 

The state space is R + , and we set r = X for the Cox-Ingersoll-Ross short rate model 

dr = (b + /3r) dt + (Ty/f dW. 



The system (J4T5J) reads 



<&(*,«) = b [ \V(s,u)ds, 
Jo 

8t*(t,«) = ia 2 * 2 (i,«) - 1, (5 ' 3) 

*(0,it) = u. 

By Lemma 15.21 below, there exists a unique solution ($(-,u), : R+ — > C_ x C_, and 

thus (|4.6p holds, for all u £ C_ and t < T. The solution is given explicitly as 

u) = -j log ' 



*(t,u) = 



cr 2 \L 3 (t) -L 4 (t)« 
'L 3 (*)-i4(t)« 



where A = \//3 2 + 2a 2 and 

Li(i) = 2(e At -l) 

L 2 (t) = A(e Ai + l) + /?(e At -l) 

L 3 (t) = A(e At + l) -,3(e At -l) 

L 4 (t) = cr 2 (e At - l) 

L s (t) = 2\e^~. 

Some tedious but elementary algebraic manipulations show that the Tt- conditional char- 
acteristic function of r(T) under the 5*-forward measure Q is given by 



ur(T) , r 

is e LF f 



-ft(t,r,8)r(t)+ gg^ 

(i-C7i(t,r, s) u )5 
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where 



Ci(t,T,S) 



L 3 (S-T)L 4 (T-t) 
2XL 3 (S-t) 



C 2 (t,T,S) = 



L 2 (J-t) L^S-t) 
U(T-t) L 3 (S-t) 



Comparing this with Lemma 1 5. II below, we conclude that the jT t -conditional distribution 
of the random variable 2r(T)/Ci(t,T, S) under the S-forward measure Q s is noncentral \ 2 
with -4| degrees of freedom and parameter of noncentrality 2C2(t,T, S)r(t). Combining this 
with (|5.1[) - (|5.2[) . we obtain explicit European bond option price formulas. 

As an application, we now compute cap prices. Let us consider a cap with strike rate k 
and tenor structure 1/4 = T < Ti < • • • < T n , with T - T_i = 1/4. Here, as usual, T 
denote the settlement dates and T;_i the reset dates for the ith caplet, i — 1, . . . , n and T n is 
the maturity of the cap. It is well known that the cash flow of a ith caplet at time T, equals 
the (1 + k/4) multiple of the cash-flow at Ti_i of a put option on the Ti-bond with strike 
price 1/(1 + k/4). Hence the cap price equals 



Cp = J2 C P l d) = (1 + «/4) £ P(0, Ti-i)!^., 



1 + k/4 



- P(Ti. 



In practice, cap prices are often quoted in Black implied volatilities. By definition, the 
implied volatility ob > is the number, which, plugged into Black's formula, yields the cap 
value Cp = XlILi where the ith caplet price is given as 



Cpl{i) 



i P(0,T 1 )(F(T l _ 1 ,T i )$(diW) 



«*(*(»))) 



with 



where F(Ti- 



,TA =4 



log 



F{T i _ 1 ,T i ) 



'-(Ti-i-t) 



&ByTi~i — t 
denotes the corresponding simple forward rate. 



As parameters for the CIR model we assume 

a 2 = 0.033, b = 0.08, /3 = -0.9, r = 0.08. 
In Table [T] we summarize the AT\fl cap prices and implied volatilities for various maturities. 



Table 1: ATM cap prices for the CIR model 



Maturity Years 


strike rate 


cap price 


implied volatility 


1 


0.0843 


0.0073 


0.4506 


2 


0.0855 


0.0190 


0.3720 


3 


0.0862 


0.0302 


0.3226 


4 


0.0866 


0.0406 


0.2890 


5 


0.0868 


0.0501 


0.2647 


6 


0.0870 


0.0588 


0.2462 


7 


0.0871 


0.0668 


0.2316 


8 


0.0872 


0.0742 


0.2198 


9 


0.0873 


0.0809 


0.2100 


10 


0.0873 


0.0871 


0.2017 


15 


0.0875 


0.1110 


0.1744 


20 


0.0876 


0.1265 


0.1594 


25 


0.0876 


0.1365 


0.1502 


30 


0.0876 


0.1430 


0.1442 



4 The cap with maturity T n is at-the-money (ATM) if its strike rate re equals the prevailing forward swap rate 
4(P(0, To) - P(0, T„))/ £7=i P(0, Ti). 
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Lemma 5.1 (Noncentral x 2 -Distribution). The noncentral \ 2 -distribution with 5 > degrees 
of freedom and noncentrality parameter C, > has density function 

1 _£±C / X \ 4 2 

/xWjM = o e 2 7 ^i_i(vCa;), x > 



2 

anci characteristic function 



I 

J 7. 



e 

(1 - 2u)2 



Here I u (x) = X^>o j\r(j+v+i) (f ) 2j ' +I> denotes the modified Bessel function of the first kind 
of order v > — 1. 

Proof. See e.g. [23]. □ 

Lemma 5.2. Consider the Riccati differential equation 

dtG — AG 2 + BG — C, G(0,«)=«, (5.4) 

where A, B, C £ C anci it 6 C, iuif/i A / anci B 2 + 4AC £ C\R_ . Let y^" denote the analytic 
extension of the real square root to C \ R_, and define A = \J B" 1 + 4AC. 

(i) 27te function 

G(tu] 2C( e *-l)-(A(e*' + l)+i?(e*'-l)) M 

^.w; A(e** + l)-.B(e**-l)-2A(e**-l)u 1 j 

is </ie unique solution of equation (15. 4p on its maximal interval of existence [0, £+(«)). 
Moreover. 



/' 



G( S , u)ds = i log ( A(eAt + 1) _ B ff!_ 2 1) _ 24(eAi _ 1) , I 



(ii) //, moreover, A > 0, S € R, 5R(C) > anci it £ C_ t/ien i+(tt) = oo and G(t,u) is 
C_ -valued. 



= l/21og(z) 



Proof, (i) Recall that the square root 
of the real square root to C \ R_ , through the main branch of the logarithm which can be 
written in the form log(z) = Jj j Hence we may write (|5.4|l as 

G — A(G — \+)(G — A_), G(0,w) = u, 

where A± = B± ^2A +4AC , and it follows that 

A+(w- A_) - X-(u- \+)e xt 



(it — A-) — (it — A + )c 



G(i,u) 

which can be seen to be equivalent to (J5T5J . As A+ A-, numerator and denominator cannot 
vanish at the same time t, and certainly not for t near zero. Hence, by the maximality of 
t+(u), (|5.5[1 is the solution of (|5.4[) for t £ [0, t+(it)). Finally, the integral (|5.6[1 is checked by 
differentiation. 



(ii) We show along the lines of the proof of Theorem 1 3. 2 1 that for this choice of coefficients 



global solutions exist for initial data it G C_ and stay in C_. To this end, write R(G) = 
AG 2 + BG — C, then 

Vl(R(G)) = A{$t(G)) 2 - A(<=s(G)) 2 + fiSR(G) - 3?(G) < A(5t(G)) 2 + BK(G) 

and since j4,B6lwe have that 5R(G(£, it)) < for all times t £ [0, t+(u)), see Corollary IB. 21 
below. Furthermore, we see that 5R(GR(G)) < (1 + |G| 2 )(|B| + \C\), hence d t \G{t,u)\ 2 < 
2(1 + \G(t,u)\ 2 )(\B\ + |G|). This implies, by Gronwall's inequality ([TTJ (10.5.1.3)]), that 
t+(u) — oo. Hence the lemma is proved. □ 
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6 Heston Stochastic Volatility Model 

This affine model, proposed by Heston [20] , generalizes the Black-Scholes model by assuming 
a stochastic volatility. 

Interest rates are assumed to be constant r(t) = r > 0, and there is one risky asset (stock) 
S — e X2 , where X — (Xl,X2) is the affine process with state space R+ x R and dynamics 

dXi = (k + nXt)dt + a\/2XldWt 

dX 2 = (r-Xi)dt+ V2X7 (pdWi + sfl - p 2 dW 2 ^ 



for some constant parameters k, a > 0, k € R, and some p € [— 1, 1]. In view of Remark 13.41 
we note that here 

k 
-1 



B = 



is singular, and hence cannot have strictly negative eigenvalues. 
The implied risk-neutral stock dynamics read 

dS = Srdt + SVZXldW 



for the Brownian motion W = pW\ + \]\ — p 2 W2 ■ We see that \J2X\ is the stochastic 
volatility of the price process S. They have possibly non-zero covariation 

d(S,Xi) = 2paSX ± dt. 

The corresponding system of Riccati equations (|3.2[) is equivalent to 

cj)(t,u) = k I ipi(s,u) ds + ru'zt 
Jo 

d t ipi(t,u) = a 2 tpi(t,u) + (2pau 2 + K)ipi(t, u) + v% — ui (6.1) 
V>i(0, U) = Ul 
1p2(t,u) = U 2 , 

which, in view of Lemma l5^2" (ii) admits an explicit global solution if Ui G C_ and < 5ftu2 < 1. 
In particular, for Ui = and by setting A = y/ (2pau2 + ft) 2 + 4cr 2 (u2 — the solution can 
be given explicitly as 

xl , k ( 2Xe ^ T ^h \ 

M «) = ~, log A(e^ + l)-(2pa M2 + K)(e^-l) + 



1pl(t,U) 



2(u 2 - u 2 2 ){e xt - 1) ( 6 - 2 ) 



A(e A * + 1) - (2po-u 2 + K)(e At - 1) 
ip 2 {t,u) = it 2 . 

Furthermore, for u = (0, 1), we obtain 

#t,0,l)=rt, 0, 1) = (0,1) T . 

Theorem 13.31 thus implies that S(T) has finite first moment, for any T £ R+, and 

E[e-'' T S(T) I T t \ = e- rT m[e x * m \ T t \ = e^ 7 V {T - t)+x ' w =e- rt S(t), 

for t < T, which is just the martingale property of S. 
We now want to compute the price 

7v(t) = e-^-^E [(5(T) - A') + I Ft] 

of a European call option on S(T) with maturity T and strike price K. Fix some p > 1 small 
enough with (0,p) € V R (T). Formula (|4~T2")| combined with (|4"7TTj) then yields 

/,\ 1 -r(T-t) 



R' 1 — p—ty 

■dy. (6.3) 



(p + iy)(p + iy 
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Alternatively, we may fix any < p < 1 and then, combining (|4,13|l with (|4,11|) . 



■K{t) = Sit) + 



2tt 



-r(T-t) 



(T-t,0,p+ly)+^ 1 (T-t,0,p+iy)X 1 (t) + ( P +ly)X 2 {t) 



K 1 - 



■ dy. (6.4) 



(p + iy)(p + iy ■ 

Since we have explicit expressions (|6.2|) for (f>(T — t, 0, p + iy) and ipi (T — t, 0, p + iy), we 
only need to compute the integral with respect to y in (|6.3p or (|6.4[1 numerically. We have 
carried out numeric experiments for European option prices using MATLAB. Fastest results 
were achieved for values p m 0.5 by using (|6.4[) whereas keeping a constant error level the 
runtime explodes at p — > 0, 1, which is due to the singularities of the integrand. Also, an 
evaluation of residua 

njt = 0,p = 1/2) -n(t = 0,p= 1/2 + e) 
ir(t = 0,p = l/2) 

for e £ [0, 1/2) U (1/2, 1] suggests that (|6.4|l is numerically more stable than (|6.3p . 

Next, we present implied volatilities obtained by (|6.4|l setting p — 1/2. As initial data for 
X and model parameters, we chose 

Ai(0) = 0.02, X 2 (0) = 0.00, a = 0.1, k = -2.0, k = 0.02, r = 0.01, p = 0.5. 

Table [2] shows implied volatilities from call option prices at t = for various strikes K and 
maturities T, computed with (|6.4p for p = 0.5. These values are in well accordance with MC 
simulations (mesh size T/500, number of sample paths = 10000). The corresponding implied 
volatility surface is shown in Figure [1] 



Table 2: Implied volatilities for the Heston model 



T-K 


0.8000 


0.9000 


1.0000 


1.1000 


1.2000 


0.5000 


0.1611 


0.1682 


0.1785 


0.1892 


0.1992 


1.0000 


0.1513 


0.1579 


0.1664 


0.1751 


0.1835 


1.5000 


0.1464 


0.1524 


0.1594 


0.1665 


0.1734 


2.0000 


0.1438 


0.1492 


0.1551 


0.1611 


0.1668 


2.5000 


0.1424 


0.1473 


0.1524 


0.1574 


0.1623 


3.0000 


0.1417 


0.1460 


0.1505 


0.1549 


0.1591 




Remark 6.1. We note that the Heston model is often written in the equivalent form 

dv = «(»7 — v)dt + asfv dWi 
dS = rSdt + Sy/vdW 
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To see the relation of the parameters of this form and the one used in this section, we simply 
set v = 2Xi, and then get 

dXi = (Rr] - Rv)dt + aV2Xl dWi Xi(0) = X 10 



dS 
S 



= rdt + V2XldW, 5(0) = e 



x 2 (0) 



from which we read off 

k = kij, k — —k, X10 = vo/2 
and all other parameters coincide. 



7 Affine Transformations and Canonical Represen- 
tation 

As above, we let X be affine on the canonical state space R™ x R n with admissible parameters 
a,ati,b,/3i. Hence, in view of 1)2.10 . for any x £ R+ x R n the process X = X x satisfies 

dX = {b + BX) dt + p(X) dW 
X(0)=x, (7,1) 

and p(x)p(x) T = a + J2iei Xia ^ 

It can easily be checked that for every invertible d x d-matrix A, the linear transform 

Y = AX 

satisfies 

dY = (Ab + ASA _1 y) dt + Ap (A~V) dW, Y(0) = Ax. (7.2) 
Hence, Y has again an affine drift and diffusion matrix 

Ab + ABA^y and Aa(A _1 y)A T , (7.3) 

respectively. 

On the other hand, the affine short rate model ()4.1)l can be expressed in terms of Y(t) as 

r(t) = c + 7 T A _1 F(t) . (7.4) 

This shows that Y and (|7.4)) specify an affine short rate model producing the same short 
rates, and thus bond prices, as X and ()4.1)) . That is, an invertible linear transformation of 
the state process changes the particular form of the stochastic differential equation (|7.ip . But 
it leaves observable quantities, such as short rates and bond prices invariant. 

This motivates the question whether there exists a classification method ensuring that 
affine short rate models with the same observable implications have a unique canonical rep- 
resentation. This topic has been addressed in [101 [9l [24l [8]. We now elaborate on this issue 
and show that the diffusion matrix a(x) can always be brought into block-diagonal form by 
a regular linear transform A with A(R^ x f") = x R". 

We denote by 

diag(zi, . . . , Zm) 

the diagonal matrix with diagonal elements z± , . . . , z m , and we write 7 m for the m x m-identity 
matrix. 

Lemma 7.1. There exists some invertible d x d-matrix A with A(R+ x R n ) = R™ x I" such 
that Aa(A~ 1 y)A T is block- diagonal of the form 



Aa(A- 1 y)A 1 



diag(yi, . . . ,y q ,0, . . . , 0) 

P + T.ieiVi^ 



for some integer < q < m and symmetric positive semi-definite nxn matrices p, 7Tt , . . . , n m . 
Moreover, Ab and ABA -1 meet the respective admissibility conditions (|3.ip in lieu of b and 
B. 
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Proof. From (|2.3|l we know that Aa(x)A T is block-diagonal for all x — h~ 1 y if and only if 
AaA T and A«iA T are block-diagonal for all i € I. By permutation and scaling of the first 
m coordinate axes (this is a linear bijection from R+ x K" onto itself, which preserves the 
admissibility of the transformed b and B), we may assume that there exists some integer 
< q < m such that 0:1,11 = • • • = a q , qq = 1 and oa^u = for q < i < m. Hence a and at for 
q < i < m are already block-diagonal of the special form 





' 







< 





-1 


v 


ajj J 


, <*i = { 


v 


cti,jj 



For 1 < i < q, we may have non-zero off-diagonal elements in the i-th row on,ij. We thus 
define the n x m-matrix D — (Si, . . . , 8 m ) with i-th column Si = —Ui.ij and set 



A = 



Im 
D /„ 



One checks by inspection that D is invertible and maps R+ x R" onto R™ x R n . Moreover, 

Dai t u = —a.i 7 ji, i £ I. 

From here we easily verify that 



Acti 

and thus 



AaiA = 



OtiJI Otijj 

Daijj + ai,.jj 



on,ii 
Dotijj + oiiij 



Since AaA T = a, the first assertion is proved. 

The admissibility conditions for Afe and ABA -1 can easily be checked as well. □ 



In view of (|7,3[1 . (JTT4J) an d Lemma 17. II we thus obtain the following result. 

Theorem 7.2 (Canonical Representation). Any affine short rate model (|4.1[) . after some 
modification 0/7 if necessary, admits an R™ x R n -valued affine state process X with block- 
diagonal diffusion matrix of the form 

/ diag^i,.. .,x q ,0,...,0) \ 

V a + 22 ie i x i a i, JJ J 

for some integer < q < m. 



8 Existence and Uniqueness of Affine Processes 

All we said about the affine process X so far was under the premise that there exists a unique 
solution X = X x of the stochastic differential equation (|2.1|l on some appropriate state space 
X C R d . However, if the diffusion matrix p(x)p(x) T is affine then p(x) cannot be Lipschitz 
continuous in x in general. This raises the question whether (|2.I|I admits a solution at all. 

In this section, we show how X can always be realized as unique solution of the stochastic 
differential equation (|2.1[) , which is (|7.1[) , in the canonical affine framework X — W+ x R™ and 
for particular choices of p(x). 

We recall from Theorem 12.21 that the affine property of X imposes explicit conditions on 
p(x)p(x) T , but not on p(x) as such. Indeed, for any orthogonal d x d-matrix D, the function 
p(x)D yields the same diffusion matrix, p(x)DD T p(x) T = p(x)p(x) T , as p(x). 

On the other hand, from Theorem 13. 21 we know that any admissible parameters a, an, b, (3i 
in (|2.3[1 uniquely determine the functions (cj>(-,u),ij}(-,u)) : R+ — > C_ x C™ x iR™ as solution 
of the Riccati equations (|3,2[) , for all u £ C™ x iR" . These in turn uniquely determine the law 
of the process X. Indeed, for any < ti < t% and Ui, U2 €E C" 1 x iR", we infer by iteration of 
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= E ^e u i rx(tl) e'* (t2 ~ tl ' U2)+l/ ' (t2_tl '" 2)Tx(tl) j 

_ e <Kt2-ti,u 2 ) + cl>(t 1 ,u 1 +i>(t2-t 1 ,U2)) + lp(t 1 ,u 1 + lp(t2-t 1 ,U2)) T X 

Hence the joint distribution of (X(ti), Xfa)) is uniquely determined by the functions <f> and tp. 
By further iteration of this argument, we conclude that every finite dimensional distribution, 
and thus the law, of X is uniquely determined by the parameters a, on, b, Pi. 

We conclude that the law of an affine process X, while uniquely determined by its charac- 
teristics (|2.3p . can be realized by infinitely many variants of the stochastic differential equa- 
tion (|7.1[) by replacing p(x) by p(x)D, for any orthogonal d x d-matrix D. We now propose 
a canonical choice of p(x) as follows: 

• In view of (|7.2p and Lemma 17. II every affine process X on R™ x R™ can be written as 
X = h~ 1 Y for some invertible d x d-matrix A and some affine process Y on R+ x R" 
with block-diagonal diffusion matrix. It is thus enough to consider such p(x) where 
p(x)p(x) T is of the form (|7.5p . Obviously, p(x) = p(xi) is a function of xi only. 

• Set pu(x) = 0, p.ji(x) = 0, and 

pii(xi) = diag(yaTT, . . . , y/x^, 0, . . . , 0). 
Chose for pjj(xi) any measurable n x n-matrix-valued function satisfying 

pjj{xi)pj.j{x I ) T = a + 22 Xi°H,JJ- (8-1) 

In practice, one would determine pjj(xi) via Cholesky factorization, see e.g. [311 The- 
orem 2.2.5]. If a + Y^iei x i a i,JJ is strictly positive definite, then pjj{x{) turns out 
to be the unique lower triangular matrix with strictly positive diagonal elements and 
satisfying (|8.1[) . If a + X^igj x i° t i,.JJ ls merely positive semi-definite, then the algorithm 
becomes more involved. In any case, pjj(xi) will depend measurably on xi. 

• The stochastic differential equation (|7.1|l now reads 

dXi = (bi + BuXi) dt + pn(Xi) dWi 

dXj = (bj + BjiXi + BjjXj) dt + pjjiXj) dWj (8.2) 
X(0) = x 

Lemma 18.21 below asserts the existence and uniqueness of an R+ x R"-valued solution 
X = X x , for any x £ R™ x R™. 

We thus have shown: 

Theorem 8.1. Let a, on, b, Pi be admissible parameters. Then there exists a measurable func- 
tion p : R+ xR n -> R dxd with p(x)p(x) T = a+J2 ie i x i a i, and such that, for any x € K+ xR", 
there exists a unique R+ x R n -valued solution X — X x of (17.111 . 

Moreover, the law of X is uniquely determined by a,cti,b, Pi, and does not depend on the 
particular choice of p. 

The proof of the following lemma uses the concept of a weak solution. The interested 
reader will find detailed background in e.g. [251 Section 5.3]. 

Lemma 8.2. For any x G R™ x R", there exists a unique R™ x M. n -valued solution X = X x 
of (EH). 

Proof. First, we extend p continuously to R d by setting p(x) = p(xf , . . . , xti), where we 
denote xf = max(0,i,). 

Now observe that Xi solves the autonomous equation 

dXj = (b I +B n X I )dt + p I i(X I )dWi, X I {0) = x I . (8.3) 
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Obviously, there exists a finite constant K such that the linear growth condition 

116/ + B II x I \\ 2 + ||p(x/)|| a < K(l + \\xjf) 

is satisfied for all x G R m . By |22l Theorems 2.3 and 2.4] there exists a weak solutiorjf] 
of ()8.3|) . On the other hand, (|8.3[) is exactly of the form as assumed in [351 Theorem 1], 
which implies that pathwise uniquenes^f] holds for (|8.3p . The Yamada-Watanabe Theorem, 
see |35l Corollary 3] or [251 Corollary 5.3.23], thus implies that there exists a unique solution 
Xi = Xp of ((831), for all xi G R m . 

Given Xj 1 , it is then easily seen that 



Xj(t) = 




(b J +B JI X I (s))ds 



+ J e- B «'pjj(^j(»))<Wj(')) 

is the unique solution to the second equation in (|8.2[) . 

Admissibility of the parameters 6 and f3i and the stochastic invariance Lemma [BT] even- 
tually imply that Xi = Xj 7 is R^-valued for all xi G R+ . Whence the lemma is proved. □ 



A On the Regularity of Characteristic Functions 

This auxiliary section provides some analytic regularity results for characteristic functions, 
which are of independent interest. These results enter the main text only via the proof of 
Theorem 13.31 This section may thus be skipped at the first reading. 
Let v be a bounded measure on R d , and denote by 

G(z) = f e zTx u{dx) 

its characteristic function^ for z G iR d . Note that G(z) is actually well defined for z G S(V) 
where 

V = I y G R d | J e yT * v(dx) < ooj . 

We first investigate the interplay between the (marginal) moments of v and the corre- 
sponding (partial) regularity of G. 

Lemma A.l. Denote g(y) = G(iy) for y G R d , and let k G N and 1 < i < d. 
If d% g(0) exists then 

/ \xi\ 2k v(dx) < <x. 
Jm d 

On the other hand, if J Rd \\x\\ k v(dx) < oo then g G C k and 

d vn ■ ■ ■ 9 Vi g(y) = i x i% --- x h e ly x u{dx) 
for all y G R d , 1 < h, . . . ,ii < d and 1 < I < k. 

Proof. As usual, let e, denote the ith standard basis vector in R d . Observe that s i— > g(set) 
is the characteristic function of the image measure of v on R by the mapping x i— > n. Since 
dg g[sei)\s=o = dy g(0), the assertion follows from the one-dimensional case, see [301 Theorem 
2.3.1]. 

The second part of the lemma follows by differentiating under the integral sign, which is 
allowed by dominated convergence. □ 

5 A weak solution consists of a filtered probability space (fl, T, {^Ft), P) carrying a continuous adapted process 
Xj and a Brownian motion Wj such that 118.31 1 is satisfied. The crux of a weak solution is that Xj is not necessarily 
adapted to the filtration generated by the Brownian motion Wi. See 1351 Definition 1] or [25l Definition 5.3.1]. 

6 Pathwise uniqueness holds if, for any two weak solutions (Xj , Wj ) and (X j , Wj ) of 1 18. 31 1 defined on the the 
same probability space (O, T, P) with common Brownian motion Wi and with common initial value Xi(0) = X|(0), 
the two processes are indistinguishable: ¥[Xj(t) = X|(t) for all t > 0] = 1. See 1351 Definition 2] or 1251 Section 
5.3]. 

7 This is a slight abuse of terminology, since the characteristic function g(y) = G(iy) of v is usually defined on 
real arguments y £ M. d . However, it facilitates the subsequent notation. 
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Lemma A. 2. The set V is convex. Moreover, if U C V is an open set in M d , then G is 
analytic on the open strip S(U) in C d . 

Proof. Since G : R d -> [0, oo] is a convex function, its domain V = {y £ R d | G(y) < 00} is 
convex, and so is every level set Vi = {y £ R d | G(y) < 1} for I > 0. 

Now let U C V be an open set in R d . Since any convex function on R d is continuous on 
the open interior of its domain, see [321 Theorem 10.1], we infer that G is continuous on U. 
We may thus assume that Ui = {y £ R d | G(y) < 1} n U C Vi is open in R d and non-empty 
for I > large enough. 

Let z £ S(Ui) and (z„) be a sequence in S(Ui) with z n — > z. For n large enough, there 
exists some p > 1 such that pz n £ S(Ui). This implies pRez n £ Vi and hence 

/* I TIP 

/ e z " 1 v{dx) < I. 

Hence the class of functions {e z ™ 1 | n £ N} is uniformly integrable with respect to u, see [341 
13.3]. Since e z " x — > e z 31 for all x, we conclude by Lebesgue's convergence theorem that 

\G(z n ) - G{z)\ < f \e z ~ x -e zTx \ v(dx)^0. 

Hence G is continuous on S(Ui). 

It thus follows from the Cauchy formula, see [111 Section IX. 9], that G is analytic on S(Ui) 
if and only if, for every z £ S{Ui) and 1 < i < d, the function £ 1— > G(z + C e is analytic on 
{C £ C I z + (^Ci £ S(Ui)}. Here, as usual, we denote et the zth standard basis vector in R d . 

We thus let z £ SiUi) and 1 < i < d. Then there exists some e_ < < e+ such that z + 
Cei £ «S(f7i) for all £ £ 5([e_,e+]). In particular, \ e ( z +*-^ T x \ v (dx) and |e( z+e + e '> Ta; | 
are bounded measures on R d . By dominated convergence, it follows that the two summands 

G(z + ( ei )= [ e (c - £ - )a: 'e (z+e - e ' )T ^(dx) 

J {a:i<0} 

+ f e (c - e + )a; 'e (z+£ + <i ' )Ta; i/(dx), 

J { Xi >0} 

are complex differentiable, and thus G is analytic, in £ £ S((e_, e+)). Whence G is analytic 
on S(Ui). Since <S(C) = Ui>oS(Ui), the lemma follows. □ 

In general, V does not have an open interior in R d . The next lemma provides sufficient 
conditions for the existence of an open set U C V in R d . 

Lemma A. 3. Let U' be an open neighborhood ofO in C d and h an analytic function on U' . 
Suppose that U = [/'nl i is star-shaped around and G(z) — h(z) for all z £ U' niR d . Then 
U CV andG = h onU'n S(U). 

Proof. We first suppose that U' = P p for the open polydisc 

P P = [z £ C d I \zi\ <p u 1 < i < d| , 

for some p = (pi, . . . , pd) £ R++- Note the symmetry iP p = P p . 

As in Lemma lA.ll we denote g(y) = G(iy) for y £ R d - By assumption, g(y) = /i(ij/) for 
all y £ P p PI R d . Hence p is analytic on P p n R d , and the Cauchy formula, [111 Section IX. 9], 
yields 

g(y)= <*i,.~,i*v i i—v? forj/£P p nR d 

where ( gN Cjj^,...,^^ 1 ■ ■ ■ z l f = for all z £ P p . This power series is absolutely 

convergent on P p , that is, 

K>-,iJ ki 1 •••4" I < 00 for all z £ P p . 
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From the first part of Lemma lA.ll we infer that v possesses all moments, that is, J Rd \\x\\ k u(d: 
oo for all k £ N. From the second part of Lemma I A . 1 1 thus 



jtlH Md 



■ x, v\ax 



(da 



From the inequality \xif h 1 < (x\ k + x 2k 2 )/2, for k £ N, and the above properties, we 
infer that for all z £ P p , 



I 

JR" 



e T.Ll\*i\\*i\ v ( dx ) = " '"^1 / UH ...x^l^da;) < OO 



Hence P p n R d C V, and Lemma lA~2l implies that G is analytic on S{P P nR d ). Since the 
power series for G and /i coincide on P p niR d , we conclude that G = h on P p , and the lemma 
is proved for U' — P p . 

Now let U' be an open neighborhood of in C d . Then there exists some open polydisc 
P p C U' with p £ R++- By the preceding case, we have P p n R d C V and G = ft on P D . In 
view of Lemma lA.21 it thus remains to show that U — U' n R d C V. 

To this end, let a £ U . Since (7 is star-shaped around in R d , there exists some si > 1 
such that sa £ [/ for all s £ [0, si] and h(sa) is analytic in s £ (0,si). On the other hand, 
there exists some < so < si such that sa £ P p nR d for all s £ [0, so], and G(sa) = h(sa) for 
s £ (0, so). This implies 

/ e sa x v(dx) — h(sa) — / e sa x v(dx) 

J{a T x>0} J{a r x<0} 

for s £ (0, so). By Lemma I A. 21 the right hand side is an analytic function in s £ (0, si). 
We conclude by Lemma IA.4I below, for fi defined as the image measure of v on R+ by the 
mapping x i— > a T x, that a £ V. Hence the lemma is proved. □ 

Lemma A. 4. Let p be a bounded measure on R+, and h an analytic function on (0, si), such 
that 



I 

■J J-:-. 



e ax n{dx) = h{s) (A.l) 

for all s £ (0, so), for some numbers < so < si. Then (|A.1|) also holds for s £ (0, si). 
Proof. Denote /(s) = J R ^ e ax fi(dx) and define Soo = sup{s > | f(s) < oo} > so, such that 

f(s) = +oo for s > Soo. (A. 2) 

We assume, by contradiction, that Soo < si. Then there exists some s* £ (0, Soo) and e > 
such that s* < s^ < s* + e and such that h can be developed in an absolutely convergent 
power series 

h(s) = ^2 ^j(s — s,) h for s £ (s» — e, s* + e). 

fc>0 

In view of Lemma I A. 21 / is analytic, and thus / = h, on (0, Soo)- Hence we obtain, by 
dominated convergence, 



d k 

Ck= ds-*' 



d k f 
:h(s)\ a=s „ = — f(s)\ s =s, = J x k e s * x fi(dx) > 0. 

By monotone convergence, we conclude 

M s ) = 5l/ Tr( s _ s t ) k e a * x fi(dx) = f ^2^—(s-s t ) k e 3 * x p(dx) = f e 3X fi(dx) 



fc>0" K + fc>0 



for all s £ (s*,s* + e). But this contradicts (|A.2|) . Whence Soo > si, and the lemma is 
proved. □ 
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B Invariance and Comparison Results for Differen- 
tial Equations 

In this section we deliver invariance and comparison results for stochastic and ordinary dif- 
ferential equations, which are used in the proofs of the main Theorems 13.21 |3~!3"1 and B~T1 and 
Lemma 18.21 above. 

We start with an invariance result for the stochastic differential equation (|2.1[) . 

Lemma B.l. Suppose b and p in (|2.1[) admit a continuous and measurable extension to R d , 
respectively, and such that a is continuous on R d . Let u G R d \ {0} and define the half space 

H = {x G R d | u T x > 0}, 

its interior H° — {x G R d | u T x > 0}, and its boundary dH — {x G H \ u T x = 0}. 

(i) Fix x G dH and let X = X x be a solution of JUT]). If X(i) G H for all t > 0, then 
necessarily 

u T a(x)u = (B.l) 

u T b(x) > 0. (B.2) 

(ii) Conversely, if (|B.1|) and (|B.2|) hold for all x G R d \ H° , then any solution X of (|2.1|) 
with X(0) G H satisfies X(t) G H for all t>0. 



Intuitively speaking. (|B.1|I means that the diffusion must be "parallel to the boundary" 
and (|B.2[) says that the drift must be "inward pointing" at the boundary of H. 



Proof. Fix x G dH and let X = X x be a solution of (|2.1|) . Hence 

u T X(t) = / u T b(X(s))ds+ /" u T p(X(s))dW(s). 

Jo 

Since a and b are continuous, there exists a stopping time n > and a finite constant K such 
that 

| M T fe(X(t An))| < K 

and 

||ti T ppf(£ An))|| 2 = u T a(X{t An))« < if 
for all t > 0. In particular, the stochastic integral part of u T X(t A n) is a martingale. Hence 



E [u T X(tAn)] =E 



u T &(X(s))ds 



t > 0. 



We now argue by contradiction, and assume first that u T b(x) < 0. By continuity of b and 
X(t), there exists some e > and a stopping time T2 > such that u T 6(X(t)) < — e for all 
t < T2- In view of the above this implies 



E \u T X(r 2 An)] < 0. 



This contradicts X(t) G # for all t > 0, whence (|B~2|) holds. 

As for (|B.1|) . let C > be a finite constant and define the stochastic exponential Z t = 
£{—C Jq u T p(X) dW). Then Z is a strictly positive local martingale. Integration by parts 
yields 

u T X(t)Z(t) = J Z(s) (u T b(X(s))~Cu T a(X(s))uj ds + M(t) 

where M is a local martingale. Hence there exists a stopping time T3 > such that for all 
t > 0, 

E[n T I(iAT 3 )Z(iAT 3 )]=E J 3 Z{s)(u T b{X(s))-Cu T a(X{s))u^ds 
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Now assume that u a(x) uO. By continuity of a and X(t), there exists some e > and a 
stopping time T4 > such that u T a(X(t)) u > e for all t < T4. For C > K/e, this implies 



E ^it T X(r 4 A r 3 A Ti)Z(ta A r :i A n)j < 0. 



This contradicts X(t) £ for all t > 0. Hence (|B.1|) holds, and part (i) is proved. 

As for part I (ii) I suppose (fB~T) and pH]) hold for all x G R d \ H°, and let X be a solution 
of (|2.1|) with A'(O) G For 5, e > define the stopping time 

ra, e = inf tt \ u T X(t) < -e and u T X(s) < for all s € [t - . 
Then on {rg, e < 00} we have u T p(X(s)) = for Tg, e — 3 < s < Tg, e and thus 
> u T X(Tg, e ) - u T X{rg, t - S) = f S '' u T b(X(s)) ds > 0, 

a contradiction. Hence Tg^ = 00. Since 5, e > were arbitrary, we conclude that u T X (t) > 
for all i > 0, as desired. Whence the lemma is proved. □ 

It is straightforward to extend Lemma IB . 1 1 towards a polyhedral convex set P\ i=1 Hi with 
half-spaces Hi = {x G R d | uj x > 0}, for some elements zti, . . . , Mfc G R d \ {0} and some 
A; G N. This holds in particular for the canonical state space x R". Moreover, Lemma [B. II 
includes time-inhomogeneou^f] ordinary differential equations as special case. The proofs of 
the following two corollaries are left to the reader. 

Corollary B.2. Let Hi = {x G R d | Xi > 0} denote the i-th canonical half space in R d , for 
i = 1, . . . , m. Let b : R + x R d — > R d be a continuous map satisfying, for all t > 0, 

b(t, x) = 6(t, ari", . . . , x^, x m +i, ■ ■ ■ , Xd) for all x G R d , anrf 
6i(t, x) > for all x £ dHi, i — 1, . . . ,m. 

Then any solution f of 

dtf(t) = b(t,f(t)) 
with /(0) G R+ x R" satisfies f(t) G R+ x R n for all t > 0. 

Corollary B.3. Let B(t) andC(t) be continuous R mxm _ and -valued parameters, respec- 
tively, such that Bij(t) > whenever i 7^ j. Then the solution f of the linear differential 
equation in R m 

d t f(t) = B(t)f(t)+C(t) 
with /(0) G R+ satisfies f(t) G R+ for all t > 0. 

Here and subsequently, we let >z denote the partial order on R m induced by the cone R™. 
That is, x >z y if x — y G R+. Then Corollary IB. 31 may be rephrased, for C(t) = 0, by saying 
that the operator e ^ B ( 3 ) da is ^-order preserving, i.e. e ^ s(s)ds Rip C R^. 

Next, we consider time-inhomogeneous Riccati equations in R m of the special form 

d t fi(t) = A l fi(t) 2 + Bjf(t)+C l (t), i = l,...,m, (B.3) 

for some parameters A, B,C(t) satisfying the following admissibility conditions 

A = (Ai, . . . , A m ) G R m , 
B tJ > for 1 < i ^ j < m, (B.4) 
C(t) — (C'i(t), . . . , C m {t)) continuous R m -valued. 



The following lemma provides a comparison result for (|B.3|I . It shows, in particular, that 
the solution of ()B.3|I is uniformly bounded from below on compacts with respect to y if A >: 0. 



'Time-inhomogcncous differential equations can be made homogeneous by enlarging the state space. 
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Lemma B.4. Let A^- k ',B,C^ k \ k — 1,2, be parameters satisfying the admissibility condi- 
tions (|B.4|) . and 

A (1) <A {2 \ C w {t) <C {2) {t). (B.5) 

Let r > and / (fc) : [0, r) — > R m fee solutions of (|B.4[) witfl ,4 and (7 replaced by A {k) and 
C (fc) , respectively, k = 1,2. J// (1) (0) ^ / (2) (0) i/ien ^ / (2) (t) /or a// i G [0,r). //, 

moreover, A^ = t/ien 

c Bt (/ (1) (0) + J\- Bs C (1 \s) ds^j * f 2 \t) 

for all t G [0, r). 

Proof. The function / = / (2) - solves 

W) = 4 2) (/P°(*)) a - (/i 1 '^)) 2 + s7/ + cf)(t) - cf >(t) 

= (4 2) - (/f ) W) 2 + 4 X) (/f a) W + /*(*) + + cf\t) - cf \t) 

= ^(f) T /(t)+c;(t), 

where we write 

B i {t) = B i + A? ) (/fW+Zf'w) 

gfc) = (Af - A?) (/<*>(*))' + Cf (t) - Cf'(t). 

Note that _B = (Bij) and C satisfy the assumptions of Corollary IB, 3l in lieu of B and C, and 
/(0) G R+. Hence Corollary lB~3l implies f(t) G R+ for all t G [0,r), as desired. The last 
statement of the lemma follows by the variation of constants formula for f"'(t). □ 

After these preliminary comparison results for the Riccati equation ()B.3|) . we now can 
state and prove an important result for the system of Riccati equations (|3.2p . The following 
is an essential ingredient of the proof of Theorem 13.31 It is inspired by the line of arguments 
in Glasserman and Kim |16| . 

Lemma B.5. Let T>m denote the maximal domain for the system of Riccati equations (|3.2I) . 
Let (t, m) G X>k . Then 

(i) 2?r(t) is star-shaped around zero. 

(ii) 6* = sup{6> > | 6u G Dr(t)} satisfies either 6* = oo or limefe* \\ifii (t, 6u) \\ — oo. In the 
latter case, there exists some x* G R+ xR" such that lim#f£i» <j>{r, 9u)+ip( T , 6u) T x* = oo. 

Proof. We first assume that the matrices on are block-diagonal, such that on,ij = 0, for all 
i = 1, . . . , m. 

Fix 9 G (0, 1]. We claim that Ou G 2>r(t). It follows by inspection that f w (t) = tlM^H 
solves (|B.3|) with 

Af ] = iflai,«, B = Bjj, Cf>(t) = «) + \^>j{t, u) T e ai ,jjiPj{t, it), 

and /(0) = u. Lemma IB .41 thus implies that f^ e \t) is nice behaved, as 

e s "' (u + j\- B " s C i0 \s)ds^j < f ie) {t) < ipi(t,u), (B.6) 

for all t G [0,t+(8u)) (~l [0,r]. By the maximality of "Dm. we conclude that r < t+(0u), which 
implies 6u G X>r(t), as desired. Hence X>r(t) is star-shaped around zero, which is part (i) 

Next suppose that 0* < oo. Since 2?k(t) is open, this implies 8*u ^ 2?k(i") an d thus 
t+{6*u) < t. From part [(i)] we know that (t,9u) G Ok for all t < t+(6*u) and < 6 < 6*. 
On the other hand, there exists a sequence t„ "f t+(9*u) such that |]i/>_r(t n , > n for all 

n G N. By continuity of t/j on Dr, we conclude that there exists some sequence 9 n f #* with 
||V , -r(*i) M — i/>i(tn, S*u)\\ < 1/n and hence 

lim ||^>j(t n ,0„u)|| = oo. (B.7) 



2G 



Applying Lemma IB. 41 as above, where initial time t = is shifted to t„, yields 



9n 



(/ (9n) (*n) + e B "< l "- ! 'C<°!( S ) ds^ /( fl »)(r). 



Corollary lB~3l implies that e B 7zO-t«) is ^-order preserving. That is, e B " CT- ' n) K!F C 
Hence, in view of (|B.6|l for 

5n b e B7x(r-*») (u + j\- B " a C m (s)ds^ + J\ B "^- a) C (0) (a)d8^ 

= e B « T (u + J\- B " a C W (s)ds^ . 

On the other hand, elementary operator norm inequalities yield 

||s„||>e-« B "» T ||/^(t n )l|-e 1|B " l|T T sup ||C (0) ( S )||. 

s6[0,r] 



Together with <|B.7|) . this implies \\g„\\ — > oo. From Lemma IB. 61 below we conclude that 
lim n f^ e (r) T y* = oo for some y* € Rip. Moreover, in view of Lemma lB.41 we know that 
/ (t) t j/* is increasing Therefore limsfg* f {r) T y* = oo. Applying (|B.6|l and Lemma [B,6l 
below again, this also implies that lima^* 11/ Mil — °°- It remains to set x* — (y*,0) and 
observe that bi £ R™ and thus 

4>{t,9u) = J ^iPj(t,9u) J ajj ipj(t,9u) + bj tpi(t,6u) + b}tpj(t,6u)J dt 

is uniformly bounded from below for all 9 £ [0,6*). Thus the lemma is proved under the 
premise that the matrices at are block-diagonal for all i = 1, . . . , m. 

The general case of admissible parameters a,ai,b,j3% is reduced to the preceding block- 
diagonal case by a linear transformation along the lines of Lemma 17.11 Indeed, define the 
invertible d x d-matrix A 

A= (lm 
\ D I n 

where the n x m-matrix D = (<5i, . . . , S m ) has i-th column vector 

A = if ^" >0 
[0, else. 

It is then not hard to see that A(R+ x R n ) = R+ x R", and 

<j>(t,u) = <f>(t, A T u), $(t,u) = (A T ) ip(t,A T u) 



satisfy the system of Riccati equations (|3.2|) with a, on, b, and B = (j3i, . . . , (3d) replaced by 
the admissible parameters 

5 = AaA T , ai=AQiA T , b = Ab, B = ABA' 1 . 

Moreover, a", are block-diagonal, for all i = l,...,m. By the first part of the proof, the 
corresponding maximal domain Z>r(t), and hence also X>r(t) = A t X>r(t), is star-shaped 
around zero. Moreover, if 6* < oo, then 



lim \\ibi(T,9u)\\ = lim 

816* 



and there exists some x* £ R!J? x R n such that 

lim <j)(T,9u) +tp(T,9u) T x* = lim 4>(t,9 (A t ) ' u\ + $ (r,9 (A T ) 1 u\ Ax* = oo. 
Hence the lemma is proved. □ 
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Lemma B.6. Let c £ R m , and (c n ) and (d n ) be sequences in R m such that 

c ■< c n ^ d n 

for all n e N. TTien t/ie following are equivalent 

(i) ||Cn|| — > OO 

(ii) cly* -> oo /or some y* G R+ \ {0}. 
In either case, \\d n \\ — > oo and d„j/* — > oo. 



Proof, (i) =>■ (ii) since ||c n || 2 = 2~Z^li( c ™ e *) 2 an d Cn e » > c T e;, we conclude that c^ei — > oo 
for some i = 1 m. 



(ii) => (i) this follows from ||c„y*|| < ||e n ||||j/* 



The last statement now follows since d n y* > c n y* . □ 



Finally, we sketch an alternative proof of Theorem 13.31 ( (i) I which avoids probabilistic 
arguments. 

Remark B.7. We may without loss of generality assume block-diagonal form of an, i = 
1, . . . , d (cf. the final part of the proof of Lemma |B.5[) . Assume, by contradiction, that for 
some v G R d , t + (u + iv) < t+(u). Then, as in the first proof, we may deduce the existence of 
t n T t+ ( u + iv) such that 

lim(»Vi(*~j u + iv)) + = oo. (B.8) 

n 

holds for some i £ {1, . . . , m}. Set g(t,u + iv) := $t(ipt, u + iv), h := Q(ip(t,u + iv). Then for 
i = 1, . . . , m the following differential inequality holds, 

g t {t,u + iv) = ia<,«(fli - hi) +gja*i,jjgj - hjai,jjhj + fij g (B.9) 

< ^ai.ugf + gjoti.jjgj + pj g 

and g(t = 0,u + iv)) = ip(t — 0,u) = u. Hence noting gj(t,u + iv) = ip.j(t,u) we obtain by 
Lemma lB~4l for all t € (0,t + (u + iv)) 

fHil>(t, u + iv) = g(t,u + iv) < ip(t, u). 

On the other hand, tpi(t, u) < M for some positive constant M € R+ , for all t £ [0, t+(u+iv)], 
hence lR.ipi{t,u + iv) < Mi, which contradicts ()B.8[) , 
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